Search CORE

Kerrighed: A SSI Cluster OS Running OpenMP

Author: Berthou Jean-Yves
Lottiaux Renaud
Margery David
Morin Christine
Vallée Geoffroy
Publication venue: HAL CCSD
Publication date: 01/01/2003
Field of study

Writing parallel programs for clusters of workstations is still a challenging task. In this paper, we present Kerrighed, a Single System Image (SSI) operating system giving the illusion of an SMP machine, and providing the standard posix thread interface to developers. It is therefore possible to use Kerrighed to run OpenMP programs compiled for SMP-machines using the posix thread interface. In this paper, we explain how we managed to achieve that goal, and present the benefits of providing OpenMP support through the SSI approach as opposed to a dedicated run-time environment

CiteSeerX

Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters

Author: Berthou Jean-Yves
Lottiaux Renaud
Margery David
Morin Christine
Vallée Geoffroy
Publication venue: HAL CCSD
Publication date: 01/01/2005
Field of study

Process management mechanisms (process duplication, migration and checkpoint/restart) are very useful for high performance and high availability in clustering systems. The single system image approach aims at providing a global process management service with mechanisms for process checkpoint, process migration and process duplication. In this context, a common mechanism for process virtualization is highly desirable but traditional operating systems do not provide such a mecahnism. This paper presents a kernel service for process virtualization called ghost process, extending the Linux kernel. The ghost process mechanism has been implemented in the Kerrighed single system image based on Linux. \\ Les mécanismes de gestion de processus (duplication, migration et création de point de reprise/reprise de processus) sont particulièrement intéressants pour les systèmes pour grappes de calculateurs à haute performance et à haute disponibilité. L'un des buts des systèmes à image unique est d'offrir un service de gestion globale des processus fondé sur des mécamismes de création de points de reprise de processus, de migration de processus et de duplication de processus. Dans ce contexte, un mécanisme commun pour la virtualisation de processus est hautement bénéfique mais les systèmes d'exploitation traditionnels n'offrent pas un tel mécanisme. Ce document présente un service noyau pour la virtualisation de processus, appelé processus fantôme, fondé sur une extension du noyau Linux. Le mécanisme de processus fantôme a été mis en oeuvre dans le système à image unique Kerrughed fondé sur Linux

A New Approach to Configurable Dynamic Scheduling in Clusters based on Single System Image Technologies

Author: Berthou Jean-Yves
Morin Christine
Rilling Louis
Vallée Geoffroy
Publication venue: HAL CCSD
Publication date: 01/01/2003
Field of study

Clusters are now considered as an alternative to parallel machines to execute workloads made up of sequential and/or parallel applications. For efficient application execution on clusters, dynamic global process scheduling is of prime importance. Different dynamic scheduling policies that have been studied for distributed systems or parallel machines may be used in clusters. The choice of a particular policy depends on the kind of workload to be executed. In a cluster, it is thus highly desirable to implement a configurable global scheduler to be able to adapt the dynamic scheduling policy to the workload characteristics, to take benefit of all cluster resources and tocope with node shutdown and reboot. In this paper, we present the architecture of the global scheduler and the process management mechanisms of Kerrighed, a single system image operating system designed for high performance computing on clusters. Kerrighed provides a development framework allowing to easily implement dynamic scheduling policies without kernel modification. In Kerrighed, the global scheduling policy can be dynamically changed while applications execute on the cluster. Kerrighed's process management mechanisms allow to easily deploy parallelapplications in the cluster and to efficiently migrate or checkpoint processes, including processes sharing memory. Kerrighed has been implemented as a set of modules extending Linux kernel. Preliminary performance results are presented

Is Virtualization Killing Single System Image Research?

Author: Gallard Jérôme
Gallard Pascal
Lebre Adrien
Morin Christine
Scott Stephen,
Vallée Geoffroy
Publication venue: HAL CCSD
Publication date: 01/01/2007
Field of study

Nowadays, use of clusters in research centers or industries is undeniable. Cluster usage is typically based on two different models: (i) batch schedulers and (ii) single system image (SSI). In the first case, applications are scheduled by a ``supervisor'', the batch scheduler, according to cluster resources availability. In the second case, an SSI operating system (OS) gives the illusion that a distributed system is a standard SMP machine, allowing users to use standard UNIX tools to manage their applications. Even if SSI solutions are usually more complete in terms of functionality, batch schedulers are usually preferred because of their simplicity in term of both configuration and usage. Moreover, since few years, combining virtual machines and batch systems offer more advanced resource management capabilities, using features such as virtual machine live migration. Because of the latest contributions in the domain, some may argue that SSI technologies are now deprecated. In this paper, we analyze whether virtualization technologies will surpass the SSI approach, or if these two models are not contradictory but complementary. In fact, after evaluating different configurations, we show that by combining both approaches, we can improve several aspects associated to application computation such as flexibility of administration, simplicity of use, security and portability

Rare key functional domain missense substitutions in MRE11A, RAD50, and NBNcontribute to breast cancer susceptibility: results from a Breast Cancer Family Registry case-control mutation-screening study

Author: A di Masi
A di Masi
A Dzikiewicz-Krawczyk
A Hollestelle
A Marchler-Bauer
C Voegele
Catherine Voegele
CB Schiller
CF Bender
D Delia
David E Goldgar
DF Easton
DJ Park
EM John
Erin L Young
Esther M John
F Le Calvez-Kelm
F Le Calvez-Kelm
Fabienne Lesueur
Florence Le Calvez-Kelm
FM Boisvert
Francesca Damiola
G Luo
G Yeo
Gareth J Williams
Geoffroy Durand
GJ Williams
GJ Williams
HS Lim
I van der Burgt
IA Adzhubei
IM Wallace
Irene L Andrulis
J Bartkova
J Falck
J Felsenstein
J Lloyd
J Tommiska
J Zhu
JA Regal
Javier Oliver
John L Hopper
K Heikkinen
K Heikkinen
K Lammens
K Michailidou
Kayoko Tao
KP Hopfner
KP Hopfner
M van Puijenbroek
MA Adank
Maroulio Pertesi
Maxime P Vallée
Melissa C Southey
MP Vallee
N Uchisaka
Nathalie Forey
Nivonirina Robinot
P Bouwman
P Kumar
PE Huijts
R Waltes
RS Williams
RS Williams
S Garritano
Sean V Tavtigian
SV Tavtigian
SV Tavtigian
SV Tavtigian
T Nguyen-Dumont
T Usui
Terrell C Roane
U Dery
X Wang
Y Xiao
Z You
ZH Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Framework for High Availability Based on a Single System Image

Author: Morin Christine
Scott Stephen,
Vallée Geoffroy
Publication venue: HAL CCSD
Publication date: 01/01/2005
Field of study

High availability (HA) is today an important issue in the domain of cluster computing, clusters being more and more larger, introducing a lot of failures. Today, the literature provides a lot of different HA strategies to tolerate application failures (applications being sequential or parallel). Unfortunately, it is still difficult to implement these HA policies inside a real system, and therefore the study of these policies is most of the time just theoretic, without real implementation. Therefore, a framework to ease the implementation of such policies is interesting. Moreover, a single system image (SSI), thanks to mechanisms for the global management of cluster resources, is a good candidate to provide such a framework. This paper presents the preliminary study of this framework on top of the Kerrighed SSI